首页> 外文OA文献 >VLSI Implementation of Deep Neural Network Using Integral Stochastic Computing
【2h】

VLSI Implementation of Deep Neural Network Using Integral Stochastic Computing

机译:基于积分随机变量的深度神经网络VLsI实现   计算

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

The hardware implementation of deep neural networks (DNNs) has recentlyreceived tremendous attention: many applications in fact require high-speedoperations that suit a hardware implementation. However, numerous elements andcomplex interconnections are usually required, leading to a large areaoccupation and copious power consumption. Stochastic computing has shownpromising results for low-power area-efficient hardware implementations, eventhough existing stochastic algorithms require long streams that cause longlatencies. In this paper, we propose an integer form of stochastic computationand introduce some elementary circuits. We then propose an efficientimplementation of a DNN based on integral stochastic computing. The proposedarchitecture has been implemented on a Virtex7 FPGA, resulting in 45% and 62%average reductions in area and latency compared to the best reportedarchitecture in literature. We also synthesize the circuits in a 65 nm CMOStechnology and we show that the proposed integral stochastic architectureresults in up to 21% reduction in energy consumption compared to the binaryradix implementation at the same misclassification rate. Due to fault-tolerantnature of stochastic architectures, we also consider a quasi-synchronousimplementation which yields 33% reduction in energy consumption w.r.t. thebinary radix implementation without any compromise on performance.
机译:深度神经网络(DNN)的硬件实现最近受到了极大的关注:实际上,许多应用程序都需要适合硬件实现的高速操作。然而,通常需要大量的元件和复杂的互连,从而导致大面积的占用和大量的功率消耗。尽管现有的随机算法需要较长的数据流并导致较长的等待时间,但随机计算已显示出对低功耗,面积效率高的硬件实现有希望的结果。在本文中,我们提出了一种随机计算的整数形式,并介绍了一些基本电路。然后,我们提出了一种基于积分随机计算的DNN的高效实现。所提议的架构已在Virtex7 FPGA上实现,与文献中报道的最佳架构相比,其面积和等待时间平均减少了45%和62%。我们还使用65 nm CMOS技术合成了电路,结果表明,在相同的错误分类率下,与二进制基数实现相比,拟议的整体随机体系结构可将能耗降低多达21%。由于随机架构的容错性,我们还考虑了准同步实现,该实现可将w.r.t.的能耗降低33%。二进制基数实现,而不会影响性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号